Dual Averaging on Compactly-Supported Distributions And Application to No-Regret Learning on a Continuum

نویسنده

  • Walid Krichene
چکیده

We consider an online learning problem on a continuum. A decision maker is given a compact feasible set S, and is faced with the following sequential problem: at iteration t, the decision maker chooses a distribution x ∈ ∆(S), then a loss function ` : S → R+ is revealed, and the decision maker incurs expected loss 〈 `, x 〉 = Es∼x(t) ` (s). We view the problem as an online convex optimization problem on the space ∆(S) of Lebesgue-continnuous distributions on S. We prove a general regret bound for the Dual Averaging method on L(S), then prove that dual averaging with ω-potentials (a class of strongly convex regularizers) achieves sublinear regret when S is uniformly fat (a condition weaker than convexity).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimizing Regret on Reflexive Banach Spaces and Nash Equilibria in Continuous Zero-Sum Games

We study a general adversarial online learning problem, in which we are given a decision set X in a reflexive Banach space X and a sequence of reward vectors in the dual space of X . At each iteration, we choose an action from X , based on the observed sequence of previous rewards. Our goal is to minimize regret. Using results from infinite dimensional convex analysis, we generalize the method ...

متن کامل

Minimizing Regret on Reflexive Banach Spaces and Learning Nash Equilibria in Continuous Zero-Sum Games

We study a general version of the adversarial online learning problem. We are given a decision set X in a reflexive Banach space X and a sequence of reward vectors in the dual space of X . At each iteration, we choose an action from X , based on the observed sequence of previous rewards. Our goal is to minimize regret, defined as the gap between the realized reward and the reward of the best fi...

متن کامل

Collocation Method using Compactly Supported Radial Basis Function for Solving Volterra's Population Model

‎In this paper‎, ‎indirect collocation approach based on compactly supported radial basis function (CSRBF) is applied for solving Volterra's population model. The method reduces the solution of this problem to the solution of a system of algebraic equations‎. ‎Volterra's model is a non-linear integro-differential equation where the integral term represents the effect of toxin‎. ‎To solve the pr...

متن کامل

A survey of Algorithms and Analysis for Adaptive Online Learning

We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, proxfunction or learning rate schedule) is chosen adaptively based on the data. Adaptivity can be used to prove regret bounds that hold on every round, and also allows for data-dependent regret bounds as in AdaGrad-style algorithms (e.g., O...

متن کامل

The Hedge Algorithm on a Continuum

We consider an online optimization problem on a compact subset S ⊂ R (not necessarily convex), in which a decision maker chooses, at each iteration t, a probability distribution x over S, and seeks to minimize a cumulative expected loss, ∑T t=1 Es∼x(t) [`(s)], where ` is a Lipschitz loss function revealed at the end of iteration t. Building on previous work, we propose a generalized Hedge algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1504.07720  شماره 

صفحات  -

تاریخ انتشار 2015